|
|
Accession Number |
TCMCG024C01731 |
gbkey |
CDS |
Protein Id |
XP_021970830.1 |
Location |
join(115330159..115330265,115331849..115331975,115332093..115332190,115332316..115332391,115332742..115332861,115333326..115333373,115333485..115333549,115333621..115333706,115333835..115333999,115334071..115334180,115334684..115334789,115334872..115334947,115336262..115336320,115336721..115336788,115336889..115337062,115338127..115338333,115338661..115338917,115339003..115339146,115339414..115339471,115340443..115340733,115341093..115341776,115341882..115342094) |
Gene |
LOC110865817 |
GeneID |
110865817 |
Organism |
Helianthus annuus |
|
|
Length |
1112aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA396063 |
db_source |
XM_022115138.2
|
Definition |
DNA mismatch repair protein MSH1, mitochondrial isoform X2 [Helianthus annuus] |
CDS: ATGTGTTGGGTGACGGCGAGGTCCCTGGCAATCTCTGCTCTTCACAGCTGCCCCCCTCTTCCTCTCCCTTACTTTTCTTCCTTTACTCTCCTCCTATGCTCCTCCCCGCAGCCGAAGCAGGTCTACTGCTTCAAAGAGAGAAGGTCTACAAGCACCAAATCAGCAAAAAAGATCAGGGAACTAAAAGATCTTCTTGTGGAGAAGGATTATCCTCATATCATGTGGTGGAAGGAGAAAATGCTGACATGTGTGAAGACATCATCTATCCAATTGGTAACGCGACTGGTTTACTCCAATCTGCTTGGTTTGGATGACAACCTCAAGAATGGGAGTCTAAAAGAAGGAACGCTTAATTGCGAAATACTGAAATTCAAGTCAAGGTTCCCTCGGGAAGTTTTACTTTGTAGGGTTGGGGACTTTTATGAAGCCATTGGGTTTGATGCGTGTATCCTTGTGGAATATGCTGGTTTGAACCCTTGTGGTGGTCTTCGTTCAGATAGCGTTCCTAAAGCTGGCTGCCCTGTTGTGAATTTACGTCAAACATTGGATGATCTGACACGTAACGGGTTTTCCGTATGCATTGTTGAAGAAGTTCAGGGTCCAACTCAAGCTCGTTCTCGCAAAAGCCGTTTTATATCTGGGCATGCACATCCCGGAAGTCCTTATGTGTTTGGACTCGTGGAAGATGATCGTGATCTTGAGTTTCCTGAACCGATGCCCGTTGTTGGAGTATCCCGTTCTGCCAAAGGGTATTGCATGGTTTCAGTTCTGGAGACTATGAAGACGTTTTCTTTAGAAGACGGCTTAACTGAAGAAGCATTAGTTACCAAGCTTCGTACTTGTCGTTATCATCATTTGTTTCTCCATAAATCACTTAAGAACAATTCTTCGGGGACTTCTAGTTGGAGAGAGTTTGGTGAAGGTGGGCTATTGTGGGCAGAATGCAATGGCAGACACTTTGAATGGCTTGAAGGAGACATGCTCGATGAGATCTTATTTAGGGTAAAAGAGCTATATGGCCTCGATGACAATGTCACATTTAGAAATGTAACCATTGCTTCAGAAAACAGGCCTCGTCCATTGCATCTTGGAACAGCCTCACAAATTGGTGCTATACAGACTGAGGGAATACCTTTTTTGTTAAAATATTTGCTCCCCTCGAATTGTACTGGACTACCTGCGATGTATGTCAGAGATCTTCTTCTAAATCCTCCTGCTTACGCAATTGCATCTACCATTCAAGACATCTGCAAACTTATGAGCAATGTTTCATGCACAATTCCCGAGTTCACTTGTATCACACCATCAAAGCTTGTAAAGTTACTTGAGTTAAGGGAAACAAATCATGTTGAGTTTTGCAAGATCAAAAGTGTGCTCGATGAGGTTTTACAGTTGTATAGTAACTCTGAACTCAATGAGATACTAAGACTATTAATGGATCCTACTTGGGTGGCAACGGGACTGAAAATCGACATGGAGACCCTAGTGAAAGAATGTGAATGTGTTTCACGTAGAATTAGTGAAATTATCTCTACATATGGCGAAAGTAATCAGAAAATGAGTTCCCATGTGAACATACCAAATGAATTCTTTGAGGAGATGGAGTCTTCATGGAAGGGCCGTGTCAAGAGGATCCATCTAATAGAAGCTTACGAGGAAGTTGATAAGGCTGCCGGAGCCTTATCTTTAGCCGTTACAGAAGATTTTCTTCCGGTAGTTACGAGAGTAAGAGCTACTACTGCATCATTTGGAGGTCCAAGGGGAGAAATCGCATACGCGCGGGAGCACAAAGCTATTTGGTTCAAAGGGAAACGGTTTACGCCAGCTGTTTGGGCCGGAACCCTAGGTGAAGAACAGATCAAACAACTTAGGCCATCTGTAGATGCAAGGGGTAGAAAAGTTGGGGAGGAATGGTTTACCACTGTGAAGGTGGAGGATGCACTCACAAGGTATCATGAGGCTTGTGCAAACGCAAAGACAATGGTCTTAGATTTGTTAAGGGAACTTTCTGCCGAATTGCAAGCTAAAGTTAATGTTCTCGTTTTCGCATCCATGTTGCTTGTTATTGCAAAAGCGTTATTTGCTCACGTGAGTGAGGGGAGAAGAAGGAAATGGGTGTTCCCTACTCTCATCGATTCGTCTGGTTCTCAGGAAAAGGGACGAACACACGAGATGCAGATTACAGGTCTATCACCATATTGGTTTGATGCAGCCGAAGGCAGTGGTGTGAGGAATACAATTGTTATGAAGTCGATGTTTCTTTTAACCGGACCAAATGGAGGTGGTAAATCAAGCTTGCTTCGTTCGATTTGTGCTGCAGCACTATTTGGAATTTGCGGGTTTATGGTCCCGGCTGAGTCTGCCACAATTCCTCAATTCGACTCTATTATGTTACACATGAAATCTTATGACAGCCCTGCAGATGGGAAGAGCTCATTTCAGATAGAAATGTCAGAGTTGCGGTCTATTATTGTGGGGGCCACTTCAAAGAGCCTTGTCCTTGTGGATGAGATATGTAGAGGAACCGAAACCGCAAAAGGGACATGCATTGCTGCTAGTATTGTGGAAACTCTCGACTCCATCGGCTGTTTGGGCATTGTATCAACTCACTTGCATGATATCTTCAAGCTACCACTAACCGCAACGAATACGGTATTTAAAGCAATGGGAAGTGAACACGTGGATGGTCAAACCAAACCCACATGGAAGTTGATTGATGGGATATGCAGGGAGAGTCTTGCGTTTGAAACGGCCAAGCGCGAAGGGGTTCCCGAAGCAATCATACAAAGAGCGGAAGAGCTATATATCTCAATGCACACAACGGATGAACACGATCTTTCTAATGAAAACGGATCTCATAAAACCAACAATCACCCAATTGTTAATGAAACACTGCCTAATCTTTCGTTTGTTGAATCGACTGTCCAAATGCAGAAGTTTTTGAAGGAAGTAGAAAGTACTGTCCGTATGATATGTCAGAGAGGGTTGATTGAGGTTTGCAAGATGAGAAGTACAGTAATAAGATGTGTTCTCATTGCTCCCCGGCAACAACCCCCGCCATCGGCAATTGGTGCATCAAGTGTTTATGTGATTCTTAGACCTGACAACAGACTTTATGTTGGGGAGACTGATGATTTGGAGGGACGGGTGCGTGCTCATCGATCAAAGCAGGGAATGCAAAATGCTTCTTTCTTATATTTCTTAGTTCCTGGGAAGAGCGTAGCTTGCCAGTTGGAAACTCTTCTGATTAACCAGCTACCGAAGCATGGGTTTCAACTCACGAATATAGCCGACGGTAAGCATAGGAACTTTGGTACATGTGTTTTCTCGACCGTGTAA |
Protein: MCWVTARSLAISALHSCPPLPLPYFSSFTLLLCSSPQPKQVYCFKERRSTSTKSAKKIRELKDLLVEKDYPHIMWWKEKMLTCVKTSSIQLVTRLVYSNLLGLDDNLKNGSLKEGTLNCEILKFKSRFPREVLLCRVGDFYEAIGFDACILVEYAGLNPCGGLRSDSVPKAGCPVVNLRQTLDDLTRNGFSVCIVEEVQGPTQARSRKSRFISGHAHPGSPYVFGLVEDDRDLEFPEPMPVVGVSRSAKGYCMVSVLETMKTFSLEDGLTEEALVTKLRTCRYHHLFLHKSLKNNSSGTSSWREFGEGGLLWAECNGRHFEWLEGDMLDEILFRVKELYGLDDNVTFRNVTIASENRPRPLHLGTASQIGAIQTEGIPFLLKYLLPSNCTGLPAMYVRDLLLNPPAYAIASTIQDICKLMSNVSCTIPEFTCITPSKLVKLLELRETNHVEFCKIKSVLDEVLQLYSNSELNEILRLLMDPTWVATGLKIDMETLVKECECVSRRISEIISTYGESNQKMSSHVNIPNEFFEEMESSWKGRVKRIHLIEAYEEVDKAAGALSLAVTEDFLPVVTRVRATTASFGGPRGEIAYAREHKAIWFKGKRFTPAVWAGTLGEEQIKQLRPSVDARGRKVGEEWFTTVKVEDALTRYHEACANAKTMVLDLLRELSAELQAKVNVLVFASMLLVIAKALFAHVSEGRRRKWVFPTLIDSSGSQEKGRTHEMQITGLSPYWFDAAEGSGVRNTIVMKSMFLLTGPNGGGKSSLLRSICAAALFGICGFMVPAESATIPQFDSIMLHMKSYDSPADGKSSFQIEMSELRSIIVGATSKSLVLVDEICRGTETAKGTCIAASIVETLDSIGCLGIVSTHLHDIFKLPLTATNTVFKAMGSEHVDGQTKPTWKLIDGICRESLAFETAKREGVPEAIIQRAEELYISMHTTDEHDLSNENGSHKTNNHPIVNETLPNLSFVESTVQMQKFLKEVESTVRMICQRGLIEVCKMRSTVIRCVLIAPRQQPPPSAIGASSVYVILRPDNRLYVGETDDLEGRVRAHRSKQGMQNASFLYFLVPGKSVACQLETLLINQLPKHGFQLTNIADGKHRNFGTCVFSTV |